Representation of Pitch in Population-wide Interspike Interval Statistics
نویسنده
چکیده
Temporal coding of sensory information can arise through exogenous, stimulus-locked temporal patternings of spikes or through endogenous, stimulus-triggered temporal patterns of response [1, 2]. Phase-locking to stimulus waveforms permits strategies for stimulus localization that are based on temporal cross-correlation across sensory receptors and strategies for representation of form that are based on temporal autocorrelation within sensory channels. Phase-locked neural responses can be found in a variety of sensory modalities: audition, somatoception, vision (Fig. 1), proprioception (stretch receptors), and electroception. Figure 1. Phase-locking in a visual neuron. Responses of a typical lateral geniculate unit to a sinusoidal grating drifting at constant velocity. Period histograms (A,B) and all-order interval histograms (C,D) to resulting temporal modulations at 16 and 32 Hz. Spatial frequency rate tuning curve (E). Data courtesy of A. Prezybyszewski and D. Pollen [3]. In audition there is strong evidence that both localization and pitch perception utilize fine timing information provided by the phase-locking of auditory nerve fibers to acoustic stimuli. Our experimental work has focused on the early representation of pitch in the auditory pathway [46]. The population-wide distribution of all-order interspike intervals at the level of the auditory nerve (Fig. 2) forms an autocorrelation-like temporal representation of the stimulus whose properties explain many diverse aspects of pitch perception (e.g. missing fundamentals, levelinvariance, pitch equivalence, octave similarity, pitch shifts of inharmonic complex tones) (see also [7-10]). Population-interval distributions (PIDs) also provide robust representations for those aspects of timbre that are related to stationary power spectra (e.g. formant structure and vowel quality). PID-based models of masking and harmonic resolvability that are based on the competition of interval patterns integrate this information across cochlear territories in a manner that reflects cochlear excitation patterns. Spike timing information capable of supporting PIDbased representations is readily available in auditory brainstem, although this information becomes steadily sparser and less abundant as one ascends the pathway beyond the midbrain. A central question for auditory neurophysiology concerns the means by which the central auditory system might make use of such timing information. The bandpass modulation tunings that have been observed in the auditory pathway [11] fail to account for the level-invariant nature of pitch and pitch shifts of inharmonic tones. Thus far, except for a few possible exceptions, central pitch detectors that could account for pitch equivalences between pure tones and harmonic complexes have not been found. It is therefore difficult to envision where timerate coding transformations and periodicity detections would exist in the ascending auditory pathway. Two possible alternative working hypotheses involve 1) covert sparse-temporal coding (based on intervals or synchrony) that is distributed across neurons, 2) top-down/bottom processing strategies in which up central auditory stations organize lower level processing and access lower level pitch representations when needed. Figure 2. Population-interval representation of the low pitch of a complex tone in the auditory nerve. Auditory nerve responses to a single-formant vowel. A. Vowel waveform. A strong, low voice pitch is heard at the fundamental period of the vowel (F0=80 Hz, 1/F0=12.5 ms). B. Peristimulus time histograms of 51 cat auditory nerve fibers to 100 stimulus presentations at 60 dB SPL. Histogram baselines indicate fiber CF's. C. Power spectrum of the vowel (0.1-10kHz, log scale). D. Driven discharge rates as a function of CF. E. Stimulus autocorrelation function. F. Population-interval distribution formed by summing all-order intervals from all fibers. As a consequence of these theoretical difficulties, we have explored alternative, time-domain processing strategies for utilizing stimulus-driven fine timing information [12, 13]. In the spirit of Licklider’s duplex time-delay neural network model [8], they are meant as heuristics that illustrate functional principles and possibilities rather than descriptions of the input-output behavior of particular neuronal populations. A principal advantage of using temporally-coded pulse patterns rather than the rate-place patterns of classical connectionist networks is that temporally-coded signals can be liberated from their dependence on particular transmission lines and processing elements. Ultimately we seek neural networks for temporal coding and processing that are capable of mass action, multiplexing and de-multiplexing of signals, signal elaboration, broadcast of information, asynchronous-heterarchical rather than sequentialhierarchical organization, content-addressable memory, nonlocal storage of memory, compositionality, and support for multiple logical types. Thus far, only the functional capabilities of simple feedforward and recurrent timing nets have been explored. Figure 3. Feedforward timing nets. Basic computational operations. Feedforward neural timing nets (Fig. 3) are coincidence arrays that function as temporal pattern sieves to pass intervals and interval-sequences that are shared across inputs. As a result, these simple mechanisms are potentially useful for demultiplexing embedded spike patterns. Such nets multiply the autocorrelations of their input spike trains, and thus can be used to extract intervals related to a common fundamental period (pitch) shared by two stimuli even when the stimuli have no harmonics in common (different timbres), and vice versa. Recurrent timing nets are coincidence arrays with delay loops that build up and separate auditory objects with different fundamentals (e.g. different voices, instruments) on the basis of temporal pattern invariances. These recurrent nets suggest alternative correlation-based strategies for scene analysis that are based on temporal pattern coherence rather than on feature detection and binding.
منابع مشابه
The representation of periodic sounds in simulated sustained chopper units of the ventral cochlear nucleus.
The nature of the neural processing underlying the extraction of pitch information from harmonic complex sounds is still unclear. Electrophysiological studies in the auditory nerve and many psychophysical and modeling studies suggest that pitch might be extracted successfully by applying a mechanism like autocorrelation to the temporal discharge patterns of auditory-nerve fibers. The current mo...
متن کاملTemporal Coding of Periodicity Pitch in the Auditory System: An Overview
This paper outlines a taxonomy of neural pulse codes and reviews neurophysiological evidence for interspike interval-based representations for pitch and timbre in the auditory nerve and cochlear nucleus. Neural pulse codes can be divided into channel-based codes, temporal-pattern codes, and time-of-arrival codes. Timings of discharges in auditory nerve fibers reflect the time structure of acous...
متن کاملSpatio-temporal representation of the pitch of complex tones in the auditory nerve
Although pitch is a fundamental auditory percept that plays an important role in music, speech, and auditory scene analysis, the neural codes and mechanisms for pitch perception are still poorly understood. In a previous study (Cedolin and Delgutte 2005), we tested the effectiveness of two classic representations for the pitch of harmonic complex tones at the level of the auditory nerve (AN) in...
متن کاملNeural correlates of the pitch of complex tones. I. Pitch and pitch salience.
1. The temporal discharge patterns of auditory nerve fibers in Dial-anesthetized cats were studied in response to periodic complex acoustic waveforms that evoke pitches at their fundamental frequencies. Single-formant vowels, amplitude-modulated (AM) and quasi-frequency-modulated tones. AM noise, click trains, and other complex tones were utilized. Distributions of intervals between successive ...
متن کاملEncoding of pitch in the human brainstem is sensitive to language experience.
Neural processes underlying pitch perception at the level of the cerebral cortex are influenced by language experience. We investigated whether early, pre-attentive stages of pitch processing at the level of the human brainstem may also be influenced by language experience. The human frequency following response (FFR), reflecting sustained phase-locked activity in a population of neural element...
متن کاملPitch representations in the auditory nerve: two concurrent complex tones.
Pitch differences between concurrent sounds are important cues used in auditory scene analysis and also play a major role in music perception. To investigate the neural codes underlying these perceptual abilities, we recorded from single fibers in the cat auditory nerve in response to two concurrent harmonic complex tones with missing fundamentals and equal-amplitude harmonics. We investigated ...
متن کامل